Project Development Analysis of the OSS Community Using ST Mining

نویسندگان

  • Yongqin Gao
  • Greg Madey
چکیده

The OSS (Open Source Software) phenomenon is a novel, widely growing approach to develop both applications and infrastructure software recently. The fast growth of the community increases the interests in OSS related research. Accurate prediction of the project success is one of the interesting studies in OSS research. We propose to use the ST (Spatial Temporal) data mining techniques to predict the project success in the OSS community. ST mining has been studied in Euclidean distance based spatial systems like GIS, but to date has only received little attention in non-Euclidean network structured evolving system like the OSS community. In this paper, we introduce novel methods to project the evolving OSS community in a spatio-temporal data set and related ST mining algorithms to process the data set. Using ST mining techniques we propose, we are able to get the prediction of project success in the OSS community. We also present a detailed analysis and experimentally demonstrate the effectiveness and efficiency of these techniques in a real OSS community – SourceForge.net. The results show that our techniques can predict the project success and they are also useful in other non-Euclidean spatial systems. Contact: Yongqin Gao Dept. of Computer Science and Engineering University of Notre Dame Notre Dame, IN 46556 Tel: 1-574-631-7596 Fax: 1-574-631-9260 Email: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recovering Valuable Information Behaviour from OSS Contributors: An Exploratory Study

Context. Distributed software development is currently a modern practice in software industry. This is especially true in Open Source Software (OSS) development community. Understanding how developers’ practices are on those projects may guide communities to successfully manage their projects. Goal. We mined two repositories of the Apache Httpd project in order to gather information about its d...

متن کامل

Understanding Contributor to Developer Turnover Patterns in OSS Projects: A Case Study of Apache Projects

OSS projects are dynamic in nature. Developers contribute to a project for a certain period of time and later leaves the project or join other projects of high interest. Hence, the OSS community always welcome members who can attain the role of a developer in a project. In this paper, we investigate contributions made by members who have attained the role of a developer. In particular, we study...

متن کامل

An Adaptive Filter-Framework for the Quality Improvement of Open-Source Software Analysis

Knowledge mining in Open-Source Software (OSS) brings a great benefit for software engineering (SE). The researchers discover, investigate, and even simulate the organization of development processes within open-source communities in order to understand the community-oriented organization and to transform its advantages into conventional SE projects. Despite a great number of different studies ...

متن کامل

Predicting OSS Development Success: A Data Mining Approach

Open Source Software (OSS) has reached new levels of sophistication and acceptance by users and commercial software vendors. This research creates tests and validates a model for predicting successful development of OSS projects. Widely available archival data was used for OSS projects from Sourceforge. net. The data is analyzed with multiple Data Mining techniques. Initially three competing mo...

متن کامل

Antecedents of open source software defects: A data mining approach to model formulation, validation and testing

This paper develops tests and validates a model for the antecedents of open source software (OSS) defects, using Data and Text Mining. The public archives of OSS projects are used to access historical data on over 5,000 active and mature OSS projects. Using domain knowledge and exploratory analysis, a wide range of variables is identified from the process, product, resource, and end-user charac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005